Universal Access To All Knowledge
Home Donate | Store | Blog | FAQ | Jobs | Volunteer Positions | Contact | Bios | Forums | Projects | Terms, Privacy, & Copyright
Search: Advanced Search
Anonymous User (login or join us)
Upload

Reply to this post | See parent post | Go Back
View Post [edit]

Poster: Web-Designer-2008 Date: Nov 13, 2008 9:58am
Forum: web Subject: Re: Please delete the following sites

Thank you, I just checked it. Robots.txt does work.

The only question on the response when I query my old site:

"We're sorry, access to http://zzz.com/* has been blocked by the site owner via robots.txt."

Does this mean that even though the access to the site archive is blocked, the old archive is still stored at Archive.org?

Thank you.

Reply to this post
Reply [edit]

Poster: kustota Date: Nov 13, 2008 6:37pm
Forum: web Subject: Re: Please delete the following sites

as i understand, when there is a robots.txt block, your site is not crawled and becomes inaccessible in wayback machine. i guess the old versions of the site (before the block) are stored somewhere in alexa's archives. i might be wrong.

Reply to this post
Reply [edit]

Poster: Face_ Date: Nov 13, 2008 10:29am
Forum: web Subject: Re: Please delete the following sites

Ehm... I don't get that message when I check www.zzz.com. I see quite some copies:

http://web.archive.org/web/*/http://www.zzz.com/

The site also currently does not seem to have a robots.txt (www.zzz.com/robots.txt), even though it once had one (web.archive.org/web/*/http://www.zzz.com/robots.txt).

vukovar.com seems to be blocked now (web.archive.org/web/*/http://www.vukovar.com), but dalmatia.net seems not(web.archive.org/web/*/http://www.dalmatia.net).

As for your question, well, good question! Does kustota know the answer?

Reply to this post
Reply [edit]

Poster: Web-Designer-2008 Date: Nov 13, 2008 9:09pm
Forum: web Subject: Re: Please delete the following sites

Thank Face_ and kustota.

zzz.com was just an example, not a real site. Maybe it is :-)

It would be interesting to know for sure if there is any way for the owner to ensure that IA deletes all his archives on request, not just blocks them.



Reply to this post
Reply [edit]

Poster: kustota Date: Nov 13, 2008 9:55pm
Forum: web Subject: Re: Please delete the following sites

i don't think there is a reason to prevent some third party to keep archived copies of your site. i don't even see any legal grounds that will allow you to prevent them do it.
and what if your site changes hands? do you want the new owner to have such rights, to erase all of your work completely?

Reply to this post
Reply [edit]

Poster: protheus Date: Dec 24, 2008 5:33am
Forum: web Subject: Re: Please delete the following sites

@kustota:
" ... and what if your site changes hands? do you want the new owner to have such rights, to erase all of your work completely?"

A person who sends a deletion request wants JUST THAT: to erase all of his work from your archive. ;-)

As soon as the requester´s identity is verified, you should DELETE all concerning data, not only block access.


This post was modified by protheus on 2008-12-24 13:33:41